Chemically Aware Model Builder (camb): an R package for property and bioactivity modelling of small molecules
نویسندگان
چکیده
BACKGROUND In silico predictive models have proved to be valuable for the optimisation of compound potency, selectivity and safety profiles in the drug discovery process. RESULTS camb is an R package that provides an environment for the rapid generation of quantitative Structure-Property and Structure-Activity models for small molecules (including QSAR, QSPR, QSAM, PCM) and is aimed at both advanced and beginner R users. camb's capabilities include the standardisation of chemical structure representation, computation of 905 one-dimensional and 14 fingerprint type descriptors for small molecules, 8 types of amino acid descriptors, 13 whole protein sequence descriptors, filtering methods for feature selection, generation of predictive models (using an interface to the R package caret), as well as techniques to create model ensembles using techniques from the R package caretEnsemble). Results can be visualised through high-quality, customisable plots (R package ggplot2). CONCLUSIONS Overall, camb constitutes an open-source framework to perform the following steps: (1) compound standardisation, (2) molecular and protein descriptor calculation, (3) descriptor pre-processing and model training, visualisation and validation, and (4) bioactivity/property prediction for new molecules. camb aims to speed model generation, in order to provide reproducibility and tests of robustness. QSPR and proteochemometric case studies are included which demonstrate camb's application.Graphical abstractFrom compounds and data to models: a complete model building workflow in one package.
منابع مشابه
QSPR with ’camb’ Chemically Aware Model Builder
Daniel S. Murrell∗1,5, Isidro Cortes-Ciriano†2,5, Gerard J. P. van Westen, Ian P. Stott, Andreas Bender, Therese E. Malliavin, and Robert C. Glen Unilever Centre for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom. Unite de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3825, Structural Biology and Chemistry Department, 25-2...
متن کاملProteochemometrics (PCM) with ’camb’ Chemistry Aware Model Builder
Chemistry Aware Model Builder Isidro Cortes-Ciriano∗1,5, Daniel S. Murrell†2,5, Gerard J. P. van Westen, Ian P. Stott, Andreas Bender, Therese E. Malliavin, and Robert C. Glen Unite de Bioinformatique Structurale, Institut Pasteur and CNRS UMR 3825, Structural Biology and Chemistry Department, 25-28, rue Dr. Roux, 75 724 Paris, France. Unilever Centre for Molecular Science Informatics, Departme...
متن کاملBayesian molecular design with a chemical language model
The aim of computational molecular design is the identification of promising hypothetical molecules with a predefined set of desired properties. We address the issue of accelerating the material discovery with state-of-the-art machine learning techniques. The method involves two different types of prediction; the forward and backward predictions. The objective of the forward prediction is to cr...
متن کاملP122: Small Molecules as Chemical and Pharmacological Tools for Neuroinflammatory Diseases Treatment (with Emphasis on Multiple Sclerosis)
Multiple Sclerosis (MS) is a neuroinflammatory disease resulting in degeneration of the myelin sheaths and death of oligodendrocytes. So far, several strategies have been introduced to control the disease. Treatment with small molecules is one of the strategies that have recently attracted the attention in the scientific community. These molecules that target epigenetic and other cellular proce...
متن کاملInventory Model for Deteriorating Items Involving Fuzzy with Shortages and Exponential Demand
This paper considers the fuzzy inventory model for deteriorating items for power demand under fully backlogged conditions. We define various factors which are affecting the inventory cost by using the shortage costs. An intention of this paper is to study the inventory modelling through fuzzy environment. Inventory parameters, such as holding cost, shortage cost, purchasing cost and deteriorati...
متن کامل